601 research outputs found

    Which gene did you mean?

    Get PDF
    Computational Biology needs computer-readable information records. Increasingly, meta-analysed and pre-digested information is being used in the follow up of high throughput experiments and other investigations that yield massive data sets. Semantic enrichment of plain text is crucial for computer aided analysis. In general people will think about semantic tagging as just another form of text mining, and that term has quite a negative connotation in the minds of some biologists who have been disappointed by classical approaches of text mining. Efforts so far have tried to develop tools and technologies that retrospectively extract the correct information from text, which is usually full of ambiguities. Although remarkable results have been obtained in experimental circumstances, the wide spread use of information mining tools is lagging behind earlier expectations. This commentary proposes to make semantic tagging an integral process to electronic publishing

    The VODAN IN: support of a FAIR-based infrastructure for COVID-19

    Get PDF
    Molecular Technology and Informatics for Personalised Medicine and Healt

    Towards Customizable Chart Visualizations of Tabular Data Using Knowledge Graphs

    Get PDF
    Scientific articles are typically published as PDF documents, thus rendering the extraction and analysis of results a cumbersome, error-prone, and often manual effort. New initiatives, such as ORKG, focus on transforming the content and results of scientific articles into structured, machine-readable representations using Semantic Web technologies. In this article, we focus on tabular data of scientific articles, which provide an organized and compressed representation of information. However, chart visualizations can additionally facilitate their comprehension. We present an approach that employs a human-in-the-loop paradigm during the data acquisition phase to define additional semantics for tabular data. The additional semantics guide the creation of chart visualizations for meaningful representations of tabular data. Our approach organizes tabular data into different information groups which are analyzed for the selection of suitable visualizations. The set of suitable visualizations serves as a user-driven selection of visual representations. Additionally, customization for visual representations provides the means for facilitating the understanding and sense-making of information

    Broadening the Scope of Nanopublications

    Full text link
    In this paper, we present an approach for extending the existing concept of nanopublications --- tiny entities of scientific results in RDF representation --- to broaden their application range. The proposed extension uses English sentences to represent informal and underspecified scientific claims. These sentences follow a syntactic and semantic scheme that we call AIDA (Atomic, Independent, Declarative, Absolute), which provides a uniform and succinct representation of scientific assertions. Such AIDA nanopublications are compatible with the existing nanopublication concept and enjoy most of its advantages such as information sharing, interlinking of scientific findings, and detailed attribution, while being more flexible and applicable to a much wider range of scientific results. We show that users are able to create AIDA sentences for given scientific results quickly and at high quality, and that it is feasible to automatically extract and interlink AIDA nanopublications from existing unstructured data sources. To demonstrate our approach, a web-based interface is introduced, which also exemplifies the use of nanopublications for non-scientific content, including meta-nanopublications that describe other nanopublications.Comment: To appear in the Proceedings of the 10th Extended Semantic Web Conference (ESWC 2013

    Time-resolved photoelectron and photoion fragmentation spectroscopy study of 9-methyladenine and its hydrates: a contribution to the understanding of the ultrafast radiationless decay of excited DNA bases.

    Get PDF
    The excited state dynamics of the purine base 9-methyladenine (9Me-Ade) has been investigated by time- and energy-resolved photoelectron imaging spectroscopy and mass-selected ion spectroscopy, in both vacuum and water-cluster environments. The specific probe processes used, namely a careful monitoring of time-resolved photoelectron energy distributions and of photoion fragmentation, together with the excellent temporal resolution achieved, enable us to derive additional information on the nature of the excited states (pp*, np*, ps*, triplet) involved in the electronic relaxation of adenine. The two-step pathway we propose to account for the double exponential decay observed agrees well with recent theoretical calculations. The near-UV photophysics of 9Me-Ade is dominated by the direct excitation of the pp* (1Lb) state (lifetime of 100 fs), followed by internal conversion to the np* state (lifetime in the ps range) via conical intersection. No evidence for the involvement of a ps* or a triplet state was found. 9Me- Ade–(H2O)n clusters have been studied, focusing on the fragmentation of these species after the probe process. A careful analysis of the fragments allowed us to provide evidence for a double exponential decay profile for the hydrates. The very weak second component observed, however, led us to conclude that the photophysics were very different compared with the isolated base, assigned to a competition between (i) a direct one-step decay of the initially excited state (pp* La and/or Lb, stabilised by hydration) to the ground state and (ii) a modified two-step decay scheme, qualitatively comparable to that occurring in the isolated molecule

    Provenance-Centered Dataset of Drug-Drug Interactions

    Get PDF
    Over the years several studies have demonstrated the ability to identify potential drug-drug interactions via data mining from the literature (MEDLINE), electronic health records, public databases (Drugbank), etc. While each one of these approaches is properly statistically validated, they do not take into consideration the overlap between them as one of their decision making variables. In this paper we present LInked Drug-Drug Interactions (LIDDI), a public nanopublication-based RDF dataset with trusty URIs that encompasses some of the most cited prediction methods and sources to provide researchers a resource for leveraging the work of others into their prediction methods. As one of the main issues to overcome the usage of external resources is their mappings between drug names and identifiers used, we also provide the set of mappings we curated to be able to compare the multiple sources we aggregate in our dataset.Comment: In Proceedings of the 14th International Semantic Web Conference (ISWC) 201

    Cloudy, increasingly FAIR; Revisiting the FAIR Data guiding principles for the European Open Science Cloud

    Get PDF
    The FAIR Data Principles propose that all scholarly output should be Findable, Accessible, Interoperable, and Reusable. As a set of guiding principles, expressing only the kinds of behaviours that researchers should expect from contemporary data resources, how the FAIR principles should manifest in reality was largely open to interpretation. As support for the Principles has spread, so has the breadth of these interpretations. In observing this creeping spread of interpretation, several of the original authors felt it was now appropriate to revisit the Principles, to clarify both what FAIRness is, and is not

    Fluoxetine effects assessment on the life cycle of aquatic invertebrates

    Get PDF
    International audienceFluoxetine is a serotonin re-uptake inhibitor, generally used as an antidepressant. It is suspected to provoke substantial effects in the aquatic environment. This study reports the effects of fluoxetine on the life cycle of four invertebrate species, Daphnia magna, Hyalella azteca and the snail Potamopyrgus antipodarum exposed to fluoxetine spiked-water and the midge Chironomus riparius exposed to fluoxetine-spiked sediments. For D. magna, a multi-generational study was performed with exposition of newborns from exposed organisms. Effects of fluoxetine could be found at low measured concentrations (around 10 micro g l(-1)), especially for parthenogenetic reproduction of D. magna and P. antipodarum. For daphnids, newborns length was impacted by fluoxetine and the second generation of exposed individuals showed much more pronounced effects than the first one, with a NOEC of 8.9 micro g l(-1). For P. antipodarum, significant decrease of reproduction was found for concentrations around 10 micro g l(-1). In contrast, we found no effect on the reproduction of H. azteca but a significant effect on growth, which resulted in a NOEC of 33 micro g l(-1), expressed in nominal concentration. No effect on C. riparius could be found for measured concentrations up to 59.5 mg kg(-1). General mechanistic energy-based models showed poor relevance for data analysis, which suggests that fluoxetine targets specific mechanisms of reproduction

    Mining microarray datasets aided by knowledge stored in literature

    Get PDF
    DNA microarray technology produces large amounts of data. For data mining of these datasets, background information on genes can be helpful. Unfortunately most information is stored in free text. Here, we present an approach to use this information for DNA microarray data mining
    corecore